A Novel Online Encyclopedia-Oriented Approach for Large-Scale Knowledge Base Construction

نویسندگان

  • Ting Wang
  • Ruihua Di
  • Jicheng Song
چکیده

In the process of constructing large-scale knowledge base, manual-based construction approach lacks efficiency as well as flexibility. Therefore, automatically extracting of massive knowledge from online encyclopedia has attracted attention from an increasing number of scholars. Current research is mainly focused on the extracting of data from English online encyclopedia, whereas research about knowledge extraction from Chinese or other language data sources is rare. For such reason, the present paper proposes an automatic construction scheme for large-scale knowledge base based on Chinese online Encyclopedia. (i)In the first phase of the scheme, selfexpanded learning is performed on the semantic relations between subjects and objects among the knowledge triples. (ii)In the second phase, semantic relations between the marked attributes and their entities is predicted using Conditional Random Fields (CRFs) and Support vector machine (SVM) classifier. A large-scale knowledge base is automatically constructed based on the scheme, and the experiment results indicate that the scheme possesses feasibility and effectiveness.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Proposal for a Gene Functions Wiki

Large knowledge bases integrating different domains can provide a foundation for new applications in biology such as data mining or automated reasoning. The traditional approach to the construction of such knowledge bases is manual and therefore extremely time consuming. The ubiquity of the internet now makes large-scale community collaboration for the construction of knowledge bases, such as t...

متن کامل

The effect of language complexity and group size on knowledge construction: Implications for online learning

This  study  investigated  the  effect  of  language  complexity  and  group  size  on  knowledge construction in two online debates. Knowledge construction was assessed using Gunawardena et al.’s Interaction Analysis Model (1997). Language complexity was determined by dividing the  number  of  unique  words  by  total  words.  It  refers  to  the  lexical  variation.  The  results showed  that...

متن کامل

Online Aggregation of Coherent Generators Based on Electrical Parameters of Synchronous Generators

This paper proposes a novel approach for coherent generators online clustering in a large power system following a wide area disturbance. An interconnected power system may become unstable due to severe contingency when it is operated close to the stability boundaries. Hence, the bulk power system controlled islanding is the last resort to prevent catastrophic cascading outages and wide area bl...

متن کامل

Towards Automatic Construction of Knowledge Bases from Chinese Online Resources

Automatically constructing knowledge bases from online resources has become a crucial task in many research areas. Most existing knowledge bases are built from English resources, while few efforts have been made for other languages. Building knowledge bases for Chinese is of great importance on its own right. However, simply adapting existing tools from English to Chinese yields inferior result...

متن کامل

A Convenient Base-Mediated Diastereoselective Synthesis of 2-Oxo-N,4,6-triarylcyclohex-3-enecarboxamides via Claisen-Schmidt Condensation

Sodium acetate catalyzed the multi-component reaction of acetophenone, aromatic aldehydes, and acetoacetanilide in the water-ethanol mixture (1:1) at ambient temperature via Claisen-Schmidt condensation results in the formation of highly substituted cyclohexenones in 89–98% yields. The developed efficient catalytic approach to the substituted cyclohexenones – the promising ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • JSW

دوره 9  شماره 

صفحات  -

تاریخ انتشار 2014